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We provide a non-equilibrium thermodynamic description of the life-cycle of a droplet based, 
chemically feasible, system of protocells. By coupling the protocells metabolic kinetics with its 
thermodynamics, we demonstrate how the system can be driven out of equilibrium to ensure pro¬ 
tocell growth and replication. This coupling allows us to derive the equations of evolution and to 
rigorously demonstrate how growth and replication life-cycle can be understood as a non-equilibrium 
thermodynamic cycle. The process does not appeal to genetic information or inheritance, and is 
based only on non-equilibrium physics considerations. Our non-equilibrium thermodynamic descrip¬ 
tion of simple, yet realistic, processes of protocell growth and replication, represents an advance in 
our physical understanding of a central biological phenomenon both in connection to the origin of 
life and for modern biology. 


Introduction.- Developing a physical understanding of 
the processes underlying biological phenomena is perhaps 
one of the greatest challenges for modern physics. In 
this work we uncover the underpinning thermodynam¬ 
ics of cell growth and division in protocells composed 
of surfactant coated oil droplets in water. Our objec¬ 
tive is twofold: on the one hand, we try to disentangle 
the physical conditions for growth and division processes 
that are critical for all life. On the other hand, an un¬ 
derstanding of these phenomena may assist the ongoing 
work on assembling artificial cells in the laboratory, with 
a large number of potential technological applications. 
Finally, we believe that a thermodynamic understanding 
of cell growth and division in one of its simplest imple¬ 
mentations may facilitate our understanding of the more 
complex processes of modern cell division. 

To bypass a discussion of the controversial topic about 
“what is life”, we use an operational definition of a living 
process as a physical entity w that has the ability: (i) 
to capture material resources and turn them into build¬ 
ing blocks (grow and divide) by the use of external pro¬ 
vided free energy (a metabolic machinery); Hereby the 
system is driven out of equilibrium and should undergo 
a thermodynamic cycles every time it replicates 0; (ii) 
to process, in part by controlling the metabolic processes 
under (i), and transmit (copy) inheritable information 
to progeny; (iii) to keep its components together and 
distinguish itself from the environment (compartmental- 
ization). The compartment contains the metabolic and 
the informational system; (iv) to undergo Darwinian evo¬ 
lution through variation of the copied inheritable infor¬ 
mation and a successive selection of the better progeny. 
Important advances have been made over the years re¬ 
garding the thermodynamics of living processes [am- 
Recently, additional advances on this topic has emerged, 
relating thermodynamics, information and the essential 
chemical reactions in living systems [HHII]- 


Here we present a non-equilibrium thermodynamic 
characterization by a system of protocells that are able to 
reproduce. We device a physical self-replicating system 
with an energy transduction mechanism that converts 
chemical energy into mechanical energy that drives the 
aggregate division. We address how the protocell stabil¬ 
ity is obtained in and out of equilibrium and how an insta¬ 
bility is used to drive the protocellular self-replication. In 
addition, the system is designed under realistic conditions 
to lead and reflect laboratory experiments in this area. 
Similar protocellular life-cycle systems have already been 
implemented in the lab both based on droplets and vesi¬ 
cles [H US]. The system can be realized without any 
use of inheritable genetic information. We thus have a 
system capable of reproducing (under external supply of 
chemical energy and matter) satisfying minimally condi¬ 
tions (i) and (iii) stated above. 

As model system for simple replicating protocells we 
study oil-in-water emulsion compartments—ternary sys¬ 
tems where a surfactant layer shields small hydropho¬ 
bic volumes of oil molecules from their aqueous environ¬ 
ment (i.e. Winsor type IV emulsion). The amphipihlic 
surfactants lower the surface tension of the oil-water in¬ 
terface to a degree where suspended spherical droplets 
are thermodynamically stable due to the increase of mix¬ 
ing entropy in the system—see Figure [^a-b). Surfactant 
molecules and, to a lesser extent, oil molecules will also 
be found in aqueous solution as this again increases the 
mixing entropy of the system. We will refer to these 
compartments as oil droplets or simply droplets. These 
droplets already mimic one of the fundamental properties 
of living beings, namely, property (iii), since it can act 
as a container for a metabolism and an information sys¬ 
tem and thus defines the boundary of the living system. 
Emulsion compartments have been proposed previously 
as containers for experimental models of living systems 
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FIG. 1: (a) Our system is composed of two types of molecules in an aqueous solution: decanoic anhydrides (orange) and 

decanoic acid (blue), (b) In aqueous solutions, the conic geometry of decanoic acid molecules (also referred to as surfactants or 
lipids) promotes aggregation of spherical clusters, due to the hydrophilic behaviour of the head and the hydrophobic behaviour of 
the tail. The surfactant layer shields an encapsulated volume of decanoic anhydride, also referred to as precursors. This creates 
a primitive compartment. Notice that the cover of the surface might not be perfect, (c) A molecule of anhydride spontaneously 
hydrolyzes two molecules of decanoic acid (blue) at a certain rate km- This creates an extra availability of surface that deforms 
and eventually breaks the aggregates, (d) Experimental setup: a reactor contains an emulsion with droplets composed of 
internal precursor molecules surrounded by lipids. The reactor is constantly fed with precursors that incorporate into existing 
droplets. The spontaneous metabolic reaction converts precursors into lipids, thus changing the droplets’ surface to volume 
ratio until they become unstable and divide. An outflow at the bottom removes newly created aggregates and waste, leaving 
the total aggregate density constant, (e) A ’zoom’ into a single replication cycle: the chemical gradient of the input precursor 
molecules is the energetic inflow, whereas the newly created aggregate is the outcome of the cycle. 


We equip this model system with a simple metabolism 
by choosing an oil component that can be converted into 
surfactants. Several such metabolisms have been pro¬ 
posed based on hydrolysis or photo-fragmentation of or¬ 
ganic acid esters and anhydrides at the water-aggregate 
interface mUMIH]. Compared to lipid vesicles with en¬ 
capsulated metabolisms, the oil phase of emulsion com¬ 
partments (as well as the exterior of a lipid membrane 
container) does not introduce a diffusive barrier to nutri¬ 
ent and waste fluxes toward and from the droplet inter¬ 
face. Common to all these model metabolisms is that 
the hydrophobic compound serves as (in some of the 
above cases high-energy) nutrient that gets catabolized 
into an amphiphilic (in some of the above cases low en¬ 
ergy) building block of the compartment plus a potential 
waste molecule. For simplicity, but without loss of gen¬ 
erality in our approach, we pick anhydride hydroplysis 
as the metabolic reaction as it avoids the introduction of 
a metabolic system with associated additional chemical 
species in the system. In particular, we base our calcula¬ 
tions on a system composed of decanoic acid surfactants 
C9H19COOH and decanoic anhydride precursors. The 
metabolic reaction is then given as 

C9Hi9C(0)0C(0)CgHi9 -b H2O 2C9H19COOH, 

( 1 ) 

where the anhydride can be regarded as food molecule 


and the surfactant as building blocks. The metabolic re¬ 
action continuously converts the hydrophobic volume of 
the droplets into new surfactants. As the nutrient is hy¬ 
drophobic but depends on the availability of water, we 
expect the reaction to take place at the compartment in¬ 
terface. As a result of the changing surface to volume 
ratio, the aggregate will eventually become unstable and 
divide into smaller compartments, thereby accommodat¬ 
ing for the newly produced surface molecules [T9H2T]. 

We now investigate a maintained aggregate feeding- 
division cycle and show that it has all the properties 
needed to be considered a non-equilibrium thermody¬ 
namic cycle, as it fulfils conditions (i) and (iii), while in¬ 
formation replication (ii) and evolution (iv) are not part 
of this system. 

Thermodynamic landscape of the system.- A state, a, in 
this non-equilibrium thermodynamic system is described 
by five state variables a = (Ld, Pd,Lb,Ph,n) € N® which 
denote the number of surfactants L and surfactant pre¬ 
cursors P arranged into n oil droplets as well as in the 
bulk environment and Pb. Alternatively, we write 

(T = (L, P, “ (^d, Pd, Tb, Pb, n) (2) 

to emphasize constant total numbers Ptot = Pd + Pb and 
Ptot = Pd + Pb (and we also simplify L = Ld, P — Pd). 
We assume that the oil droplet can exchange particles, 
heat, and pressure with the environment (NPT ensemble) 
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and that transfer of heat and pressure occur significantly 
faster than transfer of matter such that the system is 
instantaneously equilibrated with respect to temperature 
and pressure. 

Aside from entropic contributions to be specified be¬ 
low, the change in (Gibbs) free energy associated with 
self-assembly of a droplet emulsion compartment from 
solution can be decomposed into the three components: 

Gdrop = A/XlA -|- A/XpP -|- Ggeo- (3) 

where A/rp and A/ip are the changes in chemical poten¬ 
tial when moving precursors and lipids from bulk into the 
aggregate, and Ggeo a geometric term expressing shape 
and surface contributions of the aggregate. A/ip can be 
calculated from their partition coefficient— i.e. the frac¬ 
tion of lipids found in bulk solution as opposed to the 
aggregates. Bachman et al. estimate this value to be 
14% for surfactants with comparable solubility [35]. At 
T = 300K, this corresponds to A/tp = —4.53kJ/mol. 
Since the anhydride has two hydrophobic chains, we set 
A/ip = 2A/ip = —9.06kJ/mol, which in turn evaluates 
to a partition coefficient of 2.5%. 

To compute the geometric contribution to the en¬ 
ergy, we observe that if the same principle of opposing 
forces [33] that dictates self-assembly of micelles will also 
drive assembly of droplet compartments, and if bending 
elastic contributions are ignored, the geometric contribu¬ 
tion to the free energy reads (see supplementary infor¬ 
mation, SI): 


Ggeo — 7® + (4) 

a 

where 7 is the surface tension, j3 the compressibility co¬ 
efficient, and a the surface area of the compartment. In 
the absence of precursor, the minimum of Ggeo where the 
opposing forces balance corresponds to oq = \fWH- As 
a function of lipid molecules, Gceo can be expanded as 
(see SI): 


Ggeo (A) ~ Gg 



(L-L*f 


(5) 


The surface tension parameter 7 can be evaluated from 
Langmuir trough measurements and equals 45.9mN/m 
and ^ equals 5.80 x lO^'^^Nm^. 

Accounting properly for the degeneracy of states, and 
thus for translational and configurational entropies, the 
free energy of a system in the state cr = {L, P, 
becomes (see SI): 

G(a) = ^llLtot+yi°pPtot+n Gdrop (-] -TS{a), (7) 

\n n J 


with the standard chemical potentials /i^ and fi°p of lipids 
and precursors, respectively, and 


S{a) = nkp log 


V 

Vp/e 


+ kp log 




( 8 ) 


being the translational and configurational entropy of the 
system at state a, where V describes the system vol¬ 
ume per droplet and Vp—the molecular volume of the 
precursor—has been chosen as typical volume unit. 

Observing that emulsion droplets of typically lOOnm 
radius have a volume of 0.0040 femtoliter, which— 
assuming a typical water-to-oil ratio of 10 : 1 —gives a sys¬ 
tem volume of 0.044 femtoliter per droplet. This also 
implies that a milliliter of emulsion has an order of mag¬ 
nitude of 10^^ oil droplets. From the ratio of precursor 
to droplet volume, it follows that each droplet contains 
some 7,430,000 molecules plus 2.5% in bulk, totalling 
to about Pc = 7,620,000 anhydride molecules. With 
L*{Pd) and a partition coefficient of 14%, this implies 
a total of Lc = 570,000 surfactant molecules. In other 
words, our emulsion consists of 9.1 volume percent de- 
canoic anhydride and 21 mmol/l decanoic acid. 

Equation Q determines the thermodynamic landscape 
of the system for a given configuration (Ltot, Atot)- Lipids 
and precursors enter and leave aggregates stochastically 
until their association and dissociation reaches detailed 
balance around an equilibrium (A, P, 

So far we have described a system in equilibrium. The 
metabolic reaction that converts supplied precursors into 
additional surfactants drives the systems out of equilib¬ 
rium —see Figure ([^). 

AtotjPtot —> Ltot + v, Ptot — ^- (9) 


where L* is the number of molecules in the droplet at 
optimal packing. In the presence of precursor, (3 and 7 
depend on the size of the core of the aggregate and thus 
the number of precursors, and a delicate competition be¬ 
tween surfactant and precursor determines the coverage 
of either component in the droplet compartment. Assum¬ 
ing a spherical oil core of Pd precursor molecules—each 
with molecular volume Vd = 0.54nm^—the optimal num¬ 
ber of surfactant molecules with tail length ^ = 1.4nm 
and effective head area ag = 25A is given as [34] : 


A*(Pd) 


47r 

Oo 


3Ep 

47r 



1/3 



( 6 ) 


Here, p is the stoichiometric ratio of the metabolic re¬ 
action and equals 1 for precursor esters and 2 for anhy¬ 
drides. We assume that this covalent reaction is essen¬ 
tially irreversible. 

If metabolic turnover operates significantly faster than 
the rearrangement of molecules as well as the fission and 
fusion of aggregates, the dynamics can be reasonably well 
approximated through a separation of time scales. We 
thus consider the overall process as a series of equilib¬ 
rium states under slowly moving boundary conditions. 
However, if a separation of time scales is not possible we 
may use a Fokker-Planck type equation of evolution, as 
described in the SI. A typical trajectory of the Fokker- 
Planck dynamics is shown in Fig 2(a). 
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traction. We realize from the parametrization above that 
only a fraction (p about 4.4% of the provided anhydrides 
are converted into additional surfactants. Thus, in order 
to properly balance the metabolic reaction, feeding must 
proceed at an average rate (fcfeed) = (Ttot)- Alter¬ 

natively, the non-converted fraction of anhydride could 
be replaced by a non-reactive oil of comparable volume 
and hydrophobicity such as eicosane, C 20 H 42 , to decouple 
the cycle turnover time from the reaction speed. 

The newly supplied precursors and surfactants change 
the equilibrium conditions, and the number of aggregates 
of the equilibrium state might change through aggregate 
division. The change on the boundary conditions induced 
by the metabolic reaction can lead the system to the 
following scenario: 


FIG. 2: (Top) feeding-metabolism-extraction cycle of an 

emulsion in (Ltot,Ttot) space (solid black arrows) around 
the calibration point (Lc, Pc) where the system preferentially 
assembles into ten droplets. Dashed red diagonal lines in¬ 
dicate configurations where the partitioning of matter into 
n -t 1 droplets is energetically equal to a partitioning into n 
droplets, thus a single division or fusion event is expected 
each time an arrow crosses a diagonal. Importantly, the num¬ 
ber of metabolically induced divisions is one greater than the 
number of feeding induced fusions. The newly created droplet 
resulting from the net process is extracted, in order to bring 
the system back to its initial condition. (Bottom) Aggregate 
size given by the number of lipids (id) under periodic instan¬ 
taneous feeding (red), compared to continuous feeding (blue). 
Under continuous feeding, the aggregate undergoes a single di¬ 
vision event without any fusion. These numerical simulations 
confirm that a cycle is feasible with realistic parametrisation. 


Life cycles/Thermodynamic cycles.- Our objective is to 
create stable conditions for the system to undergo suc¬ 
cessive thermodynamic cycles. To this end, we provide 
a constant inflow of anhydrides to be used as building 
blocks of new droplets as well as precursors of surfac¬ 
tants through the metabolic reaction—see Figure 
Let us assume a huge reservoir of Lq^Pq total lipids 
and precursors, organized in its (local) equilibrium state 
in n droplets. We feed this reactor from the top with 
APfeed precursors and water, such that it reaches the 
state Li,Pi = Lo,To + APfeed- Within the turnover 
time of the cycle, the reaction will convert a fraction 
AP of the precursors into AL = vAP new surfactants. 
This will drive the system into a new state consisting of 
T 2 ,P 2 = Ti-f AP, Pf —AP lipids and precursors. We 
compensate the inflow of precursors by an outflow that 
constantly removes from the system material propor¬ 
tional to one droplet (containing L 2 /{n-\-l) lipid and 
p 2 /(^ + l) precursor molecules) plus the proportional 
volume of aqueous solution. 

If feeding and metabolism are tuned correctly, extrac¬ 
tion of newly created droplets will compensate the in¬ 
flow of precursors and the initial condition Lq , Pg will be 
recovered—see Figure for the path through state space 
spanned by the actions of feeding, metabolism, and ex- 


G((t„) > G{an+i), 


( 10 ) 


where tT„ and an+i are states of the system with the same 
amount of total molecules, (p 2 ,P 2 ), but differing in the 
number of aggregates from n to n-l-1. If condition (10 1 is 
satisfied, duplication of aggregates is expected to occur. 

At the microscopic level, the creation of new lipids will 
introduce a perturbation to the surface of existing aggre¬ 
gates, whose size will grow until, in some aggregate, the 
frustration due to the geometric term will drive it into an 
unstable state and, presumably, break in two small, more 
stable aggregates—see Figure [T^. Aggregates in this cy¬ 
cle are thus able to create new aggregates by division, 
and we say that aggregates exposed to these conditions 
self-replicate. Numerical simulations shown in Figure 
indicate that cyclic processes of droplet division are ex¬ 
pected under realistic parametrisation. 

These boundary conditions maintain a stationary cy¬ 
cle, driven by the supply of precursors and extraction of 
waste, dissipated through the metabolic turnover into the 
eventual fusion of aggregates. When a new aggregate is 
created, we say that a life cycle has been completed. We 
will refer to such cycle as uj. For the sake of discussion, 
we consider the initial point to be an equilibrium state 
with n aggregates. After feeding and metabolic turnover 
the system rearranges into a new equilibrium state con¬ 
taining n-fl aggregates, one of which subsequently being 
expelled by the boundary conditions to reconstitute the 
initial condition. We recognize the newly created (and 
expelled) aggregate as the outcome of the life cycle. Since 
w is a closed path over a potential function, we have that 


AGfeeding T AGjnetabolisni 4“ AGextraction — 0, (11) 


from which we can derive the energy change associated 
with the replication process as 


AGrep — AGfeeding AGextraction 
= G(L2,-P2)-G(Li,Pi) 

~ AGjiietabolism ~ AGp_^,y 4 ^AF, 


( 12 ) 


which depends on the exact change in chemical poten¬ 
tial AGp^uL = — /Tp associated with the metabolic 
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reaction. For typical anhydride hydrolyses reactions, 
AGp-hjL is on the order of magnitude of — lOkJ/mol, this 
resolves to an order of magnitude of —O.IJ per millilitre 
of produced (extracted) emulsion. 

Since anhydride hydrolysis is an exergonic reaction 
that proceeds spontaneously, the energy change of equa¬ 
tion (121 associated with replication should not be con¬ 
ceptualized as work—of which we would only speak if 
the energy of the downhill reaction would be harvested 
to drive an endergonic reaction against it’s natural di¬ 
rection. Thus, in this situation we encounter a self¬ 
constructing system that spontaneously creates order, 
or “constraints”, by self-assembly from a higher energy 
state [5]. Alternatively, had we used a protocellular 
metabolism that requires external pumping of free en¬ 
ergy, e.g. the photo-fragmentation reaction mm?], the 
necessary photo-energy per produced fatty acid would 
be ~ 2.1eV, which corresponds to ~ 200kJ/mol, and the 
life-cycle could be conceptualised as a work-cycle. Using 
the same assumptions as above, this resolves to an order 
of magnitude of 20 J per millilitre of produced (extracted) 
emulsion. 

Discussion.- The full thermodynamic characterisation 
of a life cycle represents a further step towards the under¬ 
standing life as a physical phenomenon. We have shown 


that, under realistic assumptions, certain chemical sys¬ 
tems are expected to display the onset of biological be¬ 
haviour. Indeed: we have shown the feasibility and phys¬ 
ical consistency of an oil droplet container system to be 
able to grow and reproduce—i.e., to perform a thermo¬ 
dynamic cycle—in accordance with the laws of thermo¬ 
dynamics. The growing process represents the ability of 
the system to take material from the outside to be used 
as building blocks. The replication, driven by energy un¬ 
balances between geometrical configurations, implies the 
possibility of creating a population of aggregates which 
may grow and expand as long as the physical conditions 
are favourable. Our results point to the conception of 
life as an expected emerging phenomena from non-living 
chemical substances under special physical conditions of 
matter gradients and appropriate energy flows. 
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Appendix A 

1. The geometric contribution to the free energy 

In general, the free energy of an aggregate is a three- 
term function depending on its area and the set of pa¬ 
rameters accounting for the membrane properties of the 
system: 


The first term is an ideal gas term and the third term 
is ignored here (or could enter in a re-definition of 82 ). 
Thus we focus on the second B 2 term (the second-virial 
correction) in which the virial coefficient B 2 is expected 
to be of order Cq. This statement can be made more pre¬ 
cise via classical stat. mech. modeling of the surfactant 
layer or as we do now: 

Assuming the above equation of state, the compression 
term in the Gibbs energy will contain terms such as 


Ggeo — 70-1-h 


: (1) {H — HQ)'^da, 
J a 


G. 


compression 


— — J{P(a)da) —> Gidesd+B 2 kBT{L^/a)+.... 


where 7 is the surface tension, /3 the compressibility coef¬ 
ficient, and K the elastic bending modulus. We consider 
that J, /3 ^ K thus we can neglect the contribution of 
the Helfrich Hamiltonian k resulting in a model of 
opposing forces. The minimum of Ggeo is found at: 

d 

-^Ggeo =0; —)■ Oq = 

It turns out that, if we have the area of the head of the 
surfactant molecules, we can compute the ideal coverage 
number of surfactants L* from the ideal area og as: 



and the non-ideal term B 2 kBT{L^/a) will contribute to 
the compressibility K as follows: 

dP dP 

K~^ = ^idLi + 2B2kBT{L/aof 

We may then define B 2 kBT operationally to mean. 


8P 3P 

2 B 2 kBT = ao — {a/Lf ~ ^*^0^ 


and thus fi resolves to: 



where ag is the effective head area of the surfactant 
molecules. Now we interpret the role of parameters /3 
and 7 as depending on the crowding of the core of the 
aggregate, therefore, L* = L*(P). 

Let us assume that we are close to the equilibrium. If 
this is the case, we can compute increases of free energy 
through a Taylor approach: 


/3 


ag dP 
2L*^ da 


(Al) 


To compute L*, let Vp be the volume of a precursor 
molecule and ao,£ be the head area and length of a lipid 
molecule. Then V = VpP is the volume of a sphere 
consisting of P precursor molecules and 



G{a) - G(ao) + 3 ^ 


(a - ao)^ 


*\2 


AG(L) = 


its radius. The radius of the entire aggregate (precursor 
core plus lipid layer) is r + 1. Consequently, the surface 
area of the aggregate is 


a = 47r(r -|- £)^ = 47r 



2 


The compressibility coefficient /3 can be derived from 
the second virial coefficient: assuming that a virial ex¬ 
pansion is appropriate for the 2D surfactant layer that 
covers the aggregate, the equation of state for the 2D 
pressure P will contain terms of the form 

= 1 + B 2 L/a + Bg{L/a)^ + ... 


The ideal number of lipids required to cover the surface 
area a is given by 



2. Equations of Evolution 


Transitions Below we detail all the transitions occurring in our system. A state cr is completely described by five 
variables, cr = {L, as described in equation (|^ of the main text. The free energy of the system in the 
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state cr, G{a) is defined in the equation Q of the main text. Here we adopted the notation G{L, P, n) = G{a), since 
we have the need to explicitly express the changes on the variables. 


(L, P, n)(i^ot.Ptot) 

k'^ n 

-s> 

(L,P,n -1- l)(Ltot,Ptot) 


(L, P,n)(Ljot.Ptot) 

k~n 

- 

P-lP ^)(-^>tot,Aot) ’ ^11 

^-Hg(G(L.P.n-l)-G(L.P.n))//cBT 

(L,P,n)(L^ot.Ptot) 


(L -h l,-P,P)(Ltot,Pt„t) 


(L, P,n)(i^ot.Ptot) 

k~L 

(^- i,P,’^)(Ptot.Ptot);*L = 

fc+g(G(L-l,P.n)-G(L,P,n))/fci5T 

(L, P, n)(Ltot.Ptot) 

fcjPb 

(L,P -h l,n)(Ltot,Pt„t) 


(L, P, n)(i^ot.Ptot) 

k-p^ 

(^,P- i,«)(itot.Ptot);fc? = 

^+g(G(L,P-l,n)-G(P,P,n))/fcT 

(P, P, n)(Ltot,Ptot) 

kraP 
-)■ 

(L -1- 1, P — 1, P)(Ltot + l.Ptot- 

-1) 


The master equation The above described chemical relations can be inserted into a master equation. For the 
sake of readability, we rewrite x = P, Xq = Ptot, y = L, uq = Ttotj z = n, a = (x, y, z, Xq, yo)- In addition, P(cr) = 
P(a;, y, z, xq, yo), and we will use the explicit form to avoid any confusion. We then define j(x, y, z) = G{a)/kBT. The 
continuous version of the Master equation obtained from the above transitions between states equation reads: 


_9 

Ft 


F{x,y,z,xo,yo) 


kn {iz + ^^)P(a:, y, z + Sz, xq, yo) - zP(a;, y, z, xq, yo) + 


(a;, y, z + dzj — ze' 


+fcm {(a; + fe)P(a: + 6x,y- 6y, z, Xq - Sxq, yo + 5yo) - a:P(a:, y, z, Xq, yo)} 

+k^ {(xo - x + 6x)F{x - dx,y,z,xo,yo) - Fo - x)F{x,y, z,xo,yo) + 

+(a; + + Sx, y, xq, yo) - y, z, xq, yo)} 


+kl {iyo-y + i52/)P(a;, y - Sy, z, a:o, yo) - {yo - yF{x, y, z, xq, yo) 

+(y + y + 5y, z, :ro, yo) - ye^(-.?/-^?/)-^(--i')p(x, y, xo, yo)} . 


Expansion of the Master eqnation.- We expand the above master equation up to second order to obtain a 
differential operator accounting for the evolution of the system. We divide the expansion in the three parts corre¬ 
sponding to the coordinates x, y, z. 


Precursors Term - Let us rewrite the x-coordinate of the master equation (we do not explicitly write the dependence 
of the functionals on the other variables, for the sake of readability): 

d 

—P(x) = /cp {(xo — X-|-i5x)P(x — <5x) — (xo — x)P(x)-l- 

+(x + + 5x) - xe^("-'^“)-T'("¥(x)} 

We can expand the exponential up to first order, namely: 

^^(x)-^(x+Sx) ^ ^ + 0{5x^), 

ox 


to obtain 



fcp {(xo — X -I- 5x)P(x — 5x) — (xo — x)P(x)-|- 


-F{x + 6x) 


1 - 

OX 


P(x + 5x) — X 


1 - 

OX 

















Rearraging terms, we have that: 


1 d 

P(x) = xo(P(a; — fa) — Pfx)) + (x + fa)P(a; + fa) — ix — fa)P(a; — fa) + 

/Cp ot 

+(a; + fa)^^^faP(a; + fa) - a;^^faP(a;) 


dx 

Now we expand the differences up to the second order: 

xo(P(x — fa) — P(a;)) = xq 


dx 


o ^2 

-—fa + ;^fa^ + 0 { 6 x^) 
ax ox^ 


{x). 


The second term is more tricky. Indeed, whereas above the standard definition of derivative, 

/(x + fa) - f{x) 


Sx 


= f{x) +0{6x), 


works, now we have a midpoint derivative, i.e.: 

f{x + Sx) — f{x — Sx) 
2Sx 


= f{x) + 0{Sx^). 


The crucial observation is that we already have a second order correction in such a first derivative, thus we only have 
to perform one derivative to have an approach up to the second order of the difference. Therefore: 


(x + (5a;)P(a; + fa) — (x — (5a;)P(a: — Sx) = 2x 


^Sx + OiSx^)\F{x) 


And the remaining term is expanded as follows: 

(x + fa)^|^faP(a; + fa) + a;^^faP(a;) = 


Therefore, we have that 




dx 


2x — xq — X 


^a^fa a fa_^(^ fa3 

OX ax 


dj{ 


dx ) dx 2 dx'^ 


P(a:). 


Lipids Term.- It is easy to see that the y coordinate behaves exactly as the x-coordinate up to a constant. Thus, 






dy J dy 2 dy‘^ 


F{y). 


Number of aggregates term.- z coordinate evolves differently from the x, y ones. The evolution of this coordinate is 
driven by: 


i^|-P( 2 ) = (z- fa)P(z - Sz) - zF{z) + iz + 
dt 


The first term is expanded as: 


{z — fa)P(z — fa) — zF{z) 





P(z), 


and the second one: 





dl{z) 

dz 



d z d"^ 2 


P(0). 
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If we keep terms up to second order, we reach the following Fokker-Planck like equation: 

1 . 


kn dt 


[Z = 


■ dj(z) d 
^ dz Wz^'^d^ 


Metabolic Term - The contribution of the metabolic turnover to the equation of evolution represented by the master 
equation is given by the following difference: 

km {(x + fe)P(x + 5 x,y - Sy,z,xo - Sxo,yo + Syo) - xV{x,y, z,xo,yo)} ■ 

We observe that in this case we explicitly take into account the total number of lipids and precursors, xo,yo- To 
obtain a differential operator accounting for the contribution of the metabolic part to the main equation of evolution, 
we observe that the difference above reported must be approached by a directional derivative^ Vg-. If / : M" —>■ M and 
e € M", the directional derivative of / along the direction given by the vector e is given by: 

V,/ = + aai 

We observe that Vgf = V f ■ e. in our case, n = 5 and we perform the following change of variables for the sake of 
simplicity: x = xi,y = X 2 ,z = x^,xq = X 4 ,yo = x^. Therefore, the expansion will be performed over the following 
difference: 

(xi + i 5 a;i)P(a;i + 6xi,X2 — Sx2,X3,X4 — 6x4, X5 + 6x5) — xif‘{xi,X2,xz,X4,X3). 

X = (xitX 2 ,X 3 ,X 4 ,X 3 ), One can rewrite the above expression in a compressed way, namely: 

(xi + 5xi)P(x + 5gx) — a;iP(af) = a;iVg-P(a;) + 0{6), 

being 

e=(I,-l,0,-I,l). 

From the observation that Vgf = V f ■ e, higher order terms are easily computed, leading to a second order term of 
the expansion like: 


i,j<5 


dxidxj 


Ci and Cj are the zth and jth components of the vector e, respectively. Collecting the first and second order approx¬ 
imations we get the final shape of the differential operator accounting for the role of the metabolism in our system, 
£, namely: 

C = km lxiVg+- 

The Equation of Evolution for the System Collecting all the above derivations, one has that the equation of 
evolution of the system is given by: 


_9 


P((t) = ^u • V -f V • -I- IP(o’). 


V 

Q 2 


k^ 


2x-xq- x—'f{x,y,z) 


Z-+ 


2 j/-2/o - y^iix,y,z) 
dy 


5 


z^l{x,y,z) 


(k~^— k~^— k'^z 

/d^ d_ 
\dx'dy'Wz) 

/ 92 92 92 

\ dx"^ ’ 9y2 ’ dz"^ 


where: 
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and, making the temporary change of notation x = Xi,y = X2, z = X3, xq = X4, yo = X5, for the sake of simplicity, C 
reads: 


C = XiV e 


E 

*d<5 


Xi 


dxidxj ’ 


where Vg- is the directional derivative of the scalar field defined by '¥(xi,X 2 ,x^,X 4 ,x^) along the vector e = 
(1, —1,0, —1,1). CijCj are the ith and jth components of the vector e. We observe that the 5 variables of the 
system are coupled and no reduction of dimension can be performed without making assumptions. Since the time 
scales between metabolism and association-disassociation processes are such that: 

km < 

we can consider that the equilibration of the association-disassociation processes is faster enough to consider that 
metabolism always acts over equilibrated aggregates. This mathematically implies that we can neglect the contribution 
of C, leaving the variation of the total number of molecules -due to the irreversible process of metabolism- as the 
initial conditions of the following Fokker-Planck like equation: 

^P(ct) = (u • V -f V • 52 ) P((t). 


3. Calibration and nnmerical solution 


Decanoic anhydride has a molecular volume of Vp = 
0.54nm^, whilst the hydrocarbon chain of decanoic acid 
has a length of ^ = 1.4nm. We assume an effective 
head area Cq = 25A [21]. With these measurements, we 
can calculate the ideal number L* of surfactants required 
to cover a spherical oil droplet containing Pd anhydride 
molecules assuming perfect packing: 


L*{Pa) 



The measurements imply a surfactant packing parame¬ 
ter of ^ = 0.77 and a mean aggregation number of 
decanoic acid micelles T*(0) resolving to 98 molecules, 
which agrees well with reported values |25j 

As described in the main text, we penalize aggregates 
that deviate from the perfect covering with a harmonic 
expansion around the minimum (zero) energy value: 


AGiL) = G{L*) + ^^{L-L*f 

To determine Gdrop, we need an estimate for the free 
energy change of moving lidpids and precursors from bulk 
into the droplet. For surfactants, this value can be cal¬ 
culated from their partition coefficient— i.e. the fraction 
of lipids found in bulk solution as opposed to the aggre¬ 
gates. Bachman et al. estimate this value to be 14% for 
surfactants with comparable solubility [22] 

= fcBTlog(PL) = /cBTlog(0.14-i - 1). 

At T = 300K, this evaluates to A/rp = 7.52 x 
or 4.53kJ/mol. Since the anhydride has two hydrophobic 



FIG. 3: Langmuir trough measurements of a 0.5mg/ml 

decanoic acid/POPC mixture with 5 mass percent decanoic 
acid. The solid red line indicates the change in pressure upon 
compression. Maximal compression is reached at SS.llcm^ 
with P = 45.9mN/m. At this point, the infinitesimal pres¬ 
sure change dP/da (dotted black line) is 0.147mNm~^/cm^. 


chains, we set A/ip = 2A/iL = 9.06kJ/mol, which in turn 
evaluates to a partition coefficient of 2.5%. 

The parameters j3 and 7 can be deduced from Lang¬ 
muir trough measurements—see figure [^ The surface 
tension parameter 7 is the measured line pressure at 
maximal compression of the lipid monolayer and equals 
45.9mN/m. The compressibility parameter /?, deter¬ 
mined in equation (Al), relates to the slope dP/da at 
maximal compression. In our measurements, dP/da 
resolves to 0.147mNm“^/cm2, leading to /3 = 5.80 x 
lO-'^^Nm^. 


To fix Gsystem, we need to estimate configurational and 
mixing entropy components that are not yet part of the 
above free energies. The mixing entropy depends on the 
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available system volume per oil droplet as 


= nks log 


14 


system. 

Vp/e 


To estimate the system volume, we realize that emul¬ 
sion droplets of typically lOOnm radius have a volume of 
0.0040 femtoliter, which—assuming a water-to-oil ratio 
of 10:1—gives a system volume of 0.044 femtoliter per 
droplet. This also implies that a milliliter of emulsion 
has an order of magnitude of 10^^ oil droplets. 

Finally, to calculate the configurational entropy. 


‘S'config — log 




we conclude from the ratio of precursor to droplet vol¬ 
ume, that the droplet contains some 7,430,000 molecules 
plus 2.5% in bulk, totalling to about Pc = 7,620,000 
anhydride molecules. With L*(Pd) and a partition co¬ 
efficient of 14%, this implies a total of Lc = 570,000 
surfactant molecules. Expressed in concentrations, our 
emulsion consists of 287mmol/l decanoic anhydride and 
21mmol/l decanoic acid. With these numbers, we have 
all information required to explicitly compute the free 
energy Gsystem of a certain volume of emulsion. 

The numbers obtained by this calibration procedure 
consider the system in its relaxed state, where we have 
perfect covering of the droplet surface and perfect parti¬ 
tioning between bulk and aggregate. When constructing 
the non-equilibrium thermodynamic cycle, feeding and 
metabolic turnover will necessarily drive the system out 
of this relaxed state and create tension due to compres¬ 
sion or dilusion of the surfactant layer. In order to intro¬ 
duce the least bias, we design the non-equilibrium ther¬ 
modynamic cycle around our calibration point {Lc,Pc) 
such that the distance to the calibrated values is min- 
imzed. Assuming instantaneous feeding, the life cycle is 
characterized by three points: {Lq,Pq) at the beginning 
of the process, (Li,Pi) just after feeding of APfeed pre¬ 
cursor molecules, (L2) T2) after full metabolic conversion, 
and [L^,P^) = (Lq,Po) after extraction of surplus ma¬ 
terial. In order to obtain a closed cycle, the following 
conditions must hold among these points: 


Li — Lq 


Pi= P 2 + V ^{L2 


L 2 

P 2 


n-\- 1 
n 

n + 1 
n 


Lo 

Po 


Li) 


Thus, an average amount of APfeed = Pi — Po = 
n^o + has to be supplied, of which AP = 

are converted into AL = -Lg surfactants. With the 
above parametrization, this implies that only a fraction 
(j) = Lq/ {vPq + Lq) « 4.4% of the supplied precursor 
is actually converted into lipids. For any given n, these 
relations fix all but two parameters, and we choose the 


points in a way that they span a triangle whose center of 
mass coincides with {Lc,Pc)- 

We are interested in determining the number of stable 
droplets for each pair of (Ttot;-ftot) values around this 
life cycle, or equivalently, within the L, P plane. To this 
end, we sample the phase space along 25 lines of constant 
Ptot ranging from below Pq to above Pi. Along each 
line, we determine the critical amount of Ltot needed, 
such that the minimal free energies G{n)\^^ ^ ^ and 

G{n + j 4 are equal for a given n. This is done 
successively i^or growing n until the critical points fall 
outside the area of observation. By connecting points of 
equal n between different Ptot values, we obtain the bor¬ 
ders between areas of stability for certain droplet num¬ 
bers (shown as dashed lines in Figure ^ with areas la¬ 
belled with the amount of stable droplets). When the 
metabolic process crosses a stability border from left to 
right, this indicates that the surface compression of the 
aggregates is too strong to support the current number 
of droplets and the system responds by rearranging into 
a configuration with one additional droplet. When sta¬ 
bility borders are crossed from bottom to top during the 
feeding process, the surplus oil phase dilutes droplet sur¬ 
faces which in turn fuse into fewer aggregates. The cru¬ 
cial observation is that the metabolic turnover generates 
one aggregate more compared to the state before feed¬ 
ing, and alternatively, that the step from {L 2 ,P 2 ) back 
to {Lo,Pq) crosses one stability line—which corresponds 
to one droplet being expelled due to the boundary con¬ 
ditions. Figure shows an example phase space where n 
has been set to 10, but we have performed calculations 
with n ranging up to 100,000. Calculations for other sys¬ 
tem sizes show essentially the same behavior, which is 
expected as the free energy is an extensive quantity. 

When feeding in smaller batches or continuously in¬ 
stead of instantaneously, the system will traverse the 
state space on a path that lies within the triangle spanned 
by {Lo,Po), {Li,Pi) and {L 2 ,P 2 ). In the extreme case, 
where precursors are supplied at the same rate as they are 
metabolized, the metabolic path {Lq^Pq) —>• (42,^2) co¬ 
incides with the extraction path (42)^2) (Ao,Fo)- As 
can be seen in Figure instantaneous feeding along the 
{Lo,Po) —>■ {Li,Pi) path can induce droplet coalescence 
due to the sudden increase in lipid precursors. Continu¬ 
ous feeding, on the other hand, can prevent droplet fusion 
and leads the system through an uninterrupted process 
of droplet division and extraction. This can be impor¬ 
tant in applications where droplets are decorated with 
surface molecules that should not mix among different 
droplets, such as inheritable carriers of genetic informa¬ 
tion [HllilEe] 

Figure 1^ compares these two feeding strategies in de¬ 
tail: each panel shows the course of some quantity over 
the metabolic reaction path; (Li,Pi) —>• {L 2 ,P 2 ) on the 
left side, and {Lo,Po) —>■ (42,^2) on the right side. The 
individual panels show: (a) total number n of droplets 
for a system calibrated to ten droplets at {Lc,Pc), (b) 
number of precursor molecules in Ptot total and nP^ in 
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droplets, (c) number of lipid molecules Ltot in total and 
nLd in droplets, (d) the deviation L^ — L*{Pd) from ideal 
surface covering of the droplets, (e) the geometric free 
energy contribution, (f) the droplet free energy contribu¬ 
tion, (g) the total entropy of the system, (h) the total free 
energy of the system. The trajectories emphasize that 
continuous feeding prevents fusion of aggregates present 


in the case of instantaneous feeding. Moreover, the re¬ 
sults show that the surface compressibility is so strong in 
relation to other factors, that deviations from the ideal 
covering do practically not occur, and surplus lipids are 
instead found in bulk. During the metabolic turnover, 
chemical energy is mainly used to increase the system 
entropy. 



# molecules # molecules 
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along path (Li,Pi) -> (L 2 ,P 2 ) 


along path {Lq-Po) -> (L 2 ,P 2 ) 
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FIG. 4: System state along the metabolic path assuming either instantaneous feeding (left) or continuous feeding (right). 









































